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Abstract 

In this paper, we propose a partial interference cancellation (PIC) group decoding for linear disper- 
sive space-time block codes (STBC) and a design criterion for the codes to achieve full diversity when 
the PIC group decoding is used at the receiver. A PIC group decoding decodes the symbols embedded 
in an STBC by dividing them into several groups and decoding each group separately after a linear PIC 
operation is implemented. It can be viewed as an intermediate decoding between the maximum likelihood 
(ML) receiver that decodes all the embedded symbols together, i.e., all the embedded symbols are in 
a single group, and the zero-forcing (ZF) receiver that decodes all the embedded symbols separately 
and independently, i.e., each group has and only has one embedded symbol, after the ZF operation 
is implemented. The PIC group decoding provides a framework to adjust the complexity-performance 
tradeoff by choosing the sizes of the information symbol groups. Our proposed design criterion (group 
independence) for the PIC group decoding to achieve full diversity is an intermediate condition between 
the loosest ML full rank criterion of codewords and the strongest ZF linear independence condition of 
the column vectors in the equivalent channel matrix. We also propose asymptotic optimal (AO) group 
decoding algorithm which is an intermediate decoding between the MMSE decoding algorithm and the 
ML decoding algorithm. The design criterion for the PIC group decoding can be applied to the AO 
group decoding algorithm too. It is well-known that the symbol rate for a full rank linear STBC can be 
full, i.e., n t for n t transmit antennas. It has been recently shown that its rate is upper bounded by 1 if 
a code achieves full diversity with a linear receiver. The intermediate criterion proposed in this paper 
provides the possibility for codes of rates between n t and 1 that achieve full diversity with a PIC group 
decoding. This therefore provides a complexity-performance-rate tradeoff. Some design examples are 
given. 
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I. Introduction 

MIMO technology is an important advancement in wireless communications since it offers 
significant increase in channel capacity and communication reliability without requiring addi- 
tional bandwidth or transmission power. Space-time coding is an effective way to explore the 
promising potential of an MIMO system. In the coherent scenario, where the channel state 
information (CSI) is available at the receiver, the full rank design criterion is derived in [13], 
[37] to achieve the maximum diversity order in a quasi-static Rayleigh fading channel. However, 
the derivation of the full rank criterion is based on the assumption of the optimal decoding at 
the receiver. In order to achieve the maximum diversity order, received signals must be decoded 
using the maximum likelihood (ML) decoding. Unfortunately, the computational complexity of 
the ML decoding grows exponentially with the number of the embedded information symbols 
in the codeword. This often makes the ML decoding infeasible for codes with many information 
symbols embedded in. Although near-optimal decoding algorithms, such as sphere decoding or 
lattice-reduction-aided sphere decoding, exist in the literature, [4], [5], [26], [27], [42], their 
complexities may depend on a channel condition. 

In order to significantly reduce the decoding complexity, one may decode one symbol at 
a time and make the decoding complexity grow linearly with the number of the embedded 
information symbols. This can be achieved by passing the received signals through a linear 
filter, which strengthens a main symbol and suppresses all the other interference symbols and 
then one decodes the main symbol from the output of the filter. By passing the received signal 
through a filter bank, one can decode each symbol separately. There are different criteria to 
strengthen the main symbol and suppress the interference symbols. If the filter is designed to 
completely eliminate the interferences from the other symbols, we call such decoding method 
zero-forcing (ZF) or interference nulling decoding. If the filter is designed according to the 
minimum mean square error (MMSE) criterion, we call the decoding method MMSE decoding. 
The well known algorithms with the above idea are BLAST-SIC algorithms [45]. Since these 
symbol-by-symbol decoding methods may not be ML but only suboptimal, the full rank criterion 
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can not guarantee the codes to achieve the maximum diversity order. In some special cases, the 
symbol-by-symbol decoding is equivalent to the ML decoding and thus the full rank property 
ensures the codes achieve full diversity in these cases. The first such a code is the Alamouti 
code for two transmit antennas [1]. The orthogonal structure of the Alamouti code ensures that 
symbol-by-symbol decoding is equivalent to the ML decoding. The Alamouti code has inspired 
many studies on orthogonal STBC (OSTBC) [22], [23], [25], [36], [38], [44]. However, OSTBC 
suffers from a low symbol rate. In [44], it has been proved that the symbol rate of an OSTBC 
is upper-bounded by 3/4 with or without linear processing among the embedded information 
symbols or their complex conjugates for more than 2 transmit antennas and conjectured that it 
is upper bounded by ^ for 2k — 1 and 2k transmit antennas, where A: is a positive integer (this 
upper bound was shown in [22] when no linear processing is used among information symbols). 
Explicit designs of OSTBCs with rates achieving the conjectured upper bound have been given in 
[22], [25], [36]. Note that the rate only approaches to 1/2 when the number of transmit antennas 
goes large. For a general linear dispersion STBC [14], [15] that has no orthogonal structure, 
the full diversity criterion for STBC decoded with a symbol-by-symbol decoding method has 
not been discovered until recently. In [48], Zhang-Liu-Wong proposed a family of STBC called 
Toeplitz codes and proved that a Toeplitz code achieves full diversity with the ZF receiver. The 
symbol rate of a Toeplitz code approaches 1 as the block length goes to infinity. Later in [32], 
Shang-Xia extended the result in [48] and proposed a design criterion for the codes achieving 
full diversity with ZF and MMSE receivers. They also proposed a new family of STBC called 
overlapped Alamouti codes ( OAC), which has better performance than Toeplitz codes for any 
number of transmit antennas. The symbol rate of an OAC also approaches to 1 as the block 
length goes to infinity. It has been proved in [32] that the symbol rate of an STBC achieving 
full diversity with a linear receiver is upper-bounded by I. Simulation results in [32] show that 
OAC outperform OSTBC for over 4 transmit antennas. Note that it is shown in [32] that for any 
OSTBC, its MMSE/ZF receiver is the same as the ML receiver. 

Although OSTBCs can be optimally decoded in a symbol-by-symbol way, the orthogonality 
condition is too restrictive as we mentioned above. From an information theoretical point of 
view, this can cause a significant loss of channel capacity [30]. By relaxing the orthogonality 
condition on the code matrix, quasi-orthogonal STBC (QOSTBC) was introduced by Jafarkhani 
in [16], Tirkkonen-Boariu-Hottinen in [40] and Papadias-Foschini in [30] to improve the symbol 
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rate at the tradeoff of a higher decoding complexity. The basic idea of QOSTBC is to group the 
column vectors in the code matrix into pairs and keep the orthogonality among the groups of 
the column vectors while relax the orthogonality requirement within each group. Because of this 
partial orthogonality structure, QOSTBC can be (ML) decoded pair-by-pair complex symbols, 
which has a higher decoding complexity compared to the OSTBC. The original QOSTBCs do 
not possess the full diversity property. The idea of rotating information symbols in a QOSTBC 
to achieve full diversity and maintain the complex symbol pair-wise ML decoding has appeared 
independently in [34], [35], [39], and the optimal rotation angles 7r/4 and 7r/6 of the above 
mentioned information symbols for any signal constellations on square lattices and equal-literal 
triangular lattices, respectively, have been obtained in Su-Xia [35] in the sense that the diversity 
products (coding gains) are maximized. In [19], [43], [46], the authors further studied QOSTBC 
with minimum decoding complexity. The underlying constellation is assumed to be rectangular 
QAM, which can be viewed as two PAM constellations. The minimum decoding complexity 
means the code can be optimally decoded in a real-pair-wise way. Compared to the complex- 
pair-wise decodable QOSTBC, the decoding complexity of real-pair-wise decodable QOSTBC is 
lower. In [7], [17], [18], [21], [43], [47], the pair-by-pair decoding was generalized to a general 
group-by-group decoding. The symbols in a code matrix are separated into several groups and 
each group is decoded separately. With the help of graph theory, a rate | code was obtained in 
[47] that can be decoded in two groups, each group contains 5 real symbols. In [17], [18], a 
Clifford algebra approach is applied for multi-group decodable STBCs. 

In this paper, we propose a general decoding scheme called partial interference cancellation 
(PIC) group decoding algorithm for linear dispersion (complex conjugated symbols may be 
embedded) space-time block codes (STBC) [14], [15]. A PIC group decoding decodes the 
symbols embedded in an STBC by dividing them into several groups and decoding each group 
separately after a linear PIC operation is implemented. It can be viewed as an intermediate 
decoding between the ML receiver that decodes all the embedded symbols together, i.e., all 
embedded symbols are in a single group, and the ZF receiver that decodes all the embedded 
symbols separately and independently, i.e., each group has and only has one embedded symbol, 
after the ZF operation is implemented. The PIC group decoding provides a framework to adjust 
the complexity-performance tradeoff by choosing the sizes of the information symbol groups. 
It contains the previously studied decoding algorithms for codes such as OSTBC [1], [38], 
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QOSTBC [16], [19], [34], [35], [40], [43], [46], and STBC achieving full diversity with linear 
receivers [32], [48] as special cases. We propose a design criterion for STBC achieving full 
diversity with the PIC decoding algorithm. Our proposed design criterion is an intermediate 
criterion between the loosest ML full rank criterion [13], [37] of codewords and the strongest 
ZF linear independence criterion of the column vectors in the equivalent channel matrix [32]. 
We then propose asymptotic optimal (AO) group decoding algorithm which is an intermediate 
decoding between the MMSE decoding algorithm and the ML decoding algorithm. The design 
criterion for the PIC group decoding can be applied to the AO group decoding algorithm because 
of its asymptotic optimality. It is well-known that the symbol rate for a full rank linear STBC 
can be full, i.e., n t for n t transmit antennas. It has been recently shown in [32] that its rate is 
upper bounded by 1 if a code achieves full diversity with a linear receiver. The intermediate 
criterion proposed in this paper provides the possibility for codes of rates between n t and 1 
that achieve full diversity with the PIC group decoding. This therefore provides a complexity- 
performance-rate tradeoff. Design examples of STBC achieving full diversity with the PIC group 
decoding are finally presented. Our simulations show that these codes can perform better than 
the Alamouti code for 2 transmit antennas and the QOSTBC with the optimal rotation for 4 
transmit antennas. Note that a similar algorithm and an STBC design have been proposed lately 
in [28] but they do not achieve full diversity. 

This paper is organized as follows. In Section II, we describe the system model; in Section 
III, we propose the PIC group decoding algorithm, its connection with ZF decoding algorithm 
and the corresponding successive interference cancellation (SIC) aided decoding algorithm or 
PIC-SIC; In Section IV, we systematically study the diversity property of the codes decoded 
with the PIC group decoding and the PIC-SIC group decoding, and derive the design criterion. 
In Section V, we propose AO group decoding. In Section VI, we present two design examples. 
In Section VII, we present some simulation results. 

Some notations in this paper are defined as follows. 

• C: complex number field; 

• R: real number field; 

• A: a signal constellation; 

• tr: trace of a matrix; 

• Bold faced upper-case letters such as A represent matrices; 
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• Bold faced lower-case letters such as x represent column vectors; 

• Superscripts T , H , *: transpose, complex conjugate transpose, complex conjugate, respec- 
tively; 

• ||-||: Z 2 -norm for a vector; 

• \\-\\ F : Frobenius norm for a matrix; 
. i : 

II. System Model 

We consider a quasi-static Rayleigh block-fading channel with coherence time t. Assume there 
are n t transmit and n r receive antennas. The channel model is written as follows, 



*™HX + W, (!) 

n t 

where Y = (yij) E C nrXt is the received signal matrix that is received in t time slots, H = 
(hij) G C nrXn * is the channel matrix, the entries of H are assumed i.i.d. with distribution 
£/V(0, 1), X e C ntX * is the codeword matrix that is normalized so that its average energy is 
tn t , i.e., 

tr (£ {X H X}) = tn t , 

W E C" rXi is the additive white Gaussian noise matrix with i.i.d. entries w iy j ~ £/V(0, 1), SNR 
is the average signal-to-noise-ratio (SNR) at the receiver. 

In this paper, we only consider linear dispersion STBC, which covers most existing STBCs, 
[14]: 

n-l 

X = J2-i;A l + .r > i B„ (2) 

i=0 

where x { e A,i = 0, 1, . . . , n — 1, are the embedded information symbols, A is a signal 
constellation, Ai,Bi e C n ' xt , % — 0, 1, . . . , n—1, are constant matrices called dispersion matrices. 
We use X to denote the codebook, i.e., 

X= jx = Y^XiAi + xtB^Xi e A,i = 0,l,...,n- lj . (3) 

For convenience, we also use X to denote the coding scheme that is associated with the codebook. 
In order to apply a linear operation, the system model in (1) needs to be rewritten as 

y = VSNRGx + w, (4) 
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where y G C" r is the received signal vector, G G C*" rXn is an equivalent channel ma- 
trix [14], [15], [32]; x = [x , x 1: . . . , x n _i] T G -4 n is the information symbol vector; w = 
[w ,Wi, . . . ,w trir ] T G C tflr is the additive white Gaussian noise, w,i ~ CJ\f(0, 1). For many (if 
not all) existing linear dispersion (or linear lattice) STBCs, such as those in [1], [2], [9]-[ll], 
[15], [16], [20], [29], [31], [32], [35], the channel model can be rewritten in the form of (4). 
One simple observation is that for a linear dispersion STBC that is defined as 



Tl-1 



(5) 



i=0 



which is a special case of the linear dispersion STBC in (2), the channel model can always be 
written in the form of (4). All the codes in [2], [9]-[ll], [15], [20], [29], [31] fall into this 
category. Another case in which the channel model can be rewritten in the form of (4) is that 
each column of X contains linear combinations of either only x^i = 0,l,...,n — 1 or only 
x*,i = 0, 1, ... ,n — 1. Examples of such codes include the Alamouti code [1] and QOSTBCs 
[16], [35] and OAC [32]. For instance, the channel model of the Alamouti code with one receive 
antenna is 

.''0 -■> I 





/SNR 




2/o,o 2/0,1 


= V 2 


^0,0 ^0,1 



+ 



By taking unitary linear operations and conjugations, which do not change the probabilistic 
property of the white Gaussian noise, we can extract the embedded information symbol vector 
and rewrite the above channel model as follows, 

^0,0 ^0,1 



J/0,0 

2/5,1 



VSNR I -= 

V2 



h* 



-h 



o.o 





Xq 




w ,o 


) 




+ 




Xi 







(6) 



It is shown in [32] that for any OSTBC (a column may include both Xi and x* simultaneously), its 
equivalent channel (4) exists. In the case when there are multiple receive antennas, an equivalent 
channel matrix can be derived by noting that at each receiver antenna, the received signal model 
is of the same form as in (6). For example, if there are two receive antennas for the Alamouti 
code, then an equivalent channel model is 

/ 



2/o,o 
2/o,i 
2/i,o 

fix 



i 

7^ 



\ 





ho,i 


\ 




Wofi 


h* 


— h* 
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+ 


W 0,l 


hi,o 


^1,1 




Xi 




Wl,0 


h* 
"■1,1 


— h* 
a l,0. 


) 




. w ll. 
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It is not hard to see that the original channel H and an equivalent channel G satisfy the following 
property, 

\\H(X 1 -X 2 )\\ F = \\G(x 1 -x 2 )\\, (7) 

where X , . X 2 G X, X\ and x 2 are vectors of information symbols embedded in X 1 and X 2 , 
respectively. 

For a linear dispersion code with a rectangular signal constellation A, which can be viewed 
as two PAM constellations, if it does not have its equivalent channel model in (4), the channel 
model can always be written in the following form [6], [14], 



where y G 



y = VSNRG _ 

' 3va(x 

lr is the received signal vector, G G 



w, 



(8) 



2tn r X 2n 



is the equivalent channel matrix, 



w = [w ,wi, . . . , w 2 tn r } T G R 2trir is the real white Gaussian noise vector, Wi ~ A/"(0, \). The 
entries of 

' VKz{x) 
3m(x) 

can be viewed as drawn from a PAM constellation. Hence there is no essential difference between 
the models in (4) and (8) except that the noise in (8) is real. 

Note that for both channel models in (4) and (8), the entries of the equivalent channel matrix 
G are linear combinations of h i: j and h*j, < % < n r — 1, < j < n t — 1. If we use the notation 
h — [h 0: hi, . . . , = vec(H), then both (4) and (8) are special cases of the following model, 



y = VSMRG(h)x + w, 



(9) 



where G(h) G C mxn is an equivalent channel matrix, which is a function of h = [h , hi, ... , hi-i], 
hi ~ CJ\f(0, 1), x = [x , xi, . . . , x n -i] G A n is the information symbol vector, w = [w , . . . , w m -i\ 
is the additive white Gaussian noise vector. For convenience, we always assume that noise 
w is complex Gaussian, while for real Gaussian w, the derivation is exactly the same. From 
the following discussions, we shall see later that not only the channel model in (9) contains 
the equivalent channel model derived from transforming the original channel model of linear 
dispersion STBC in (1), but also it is a resulted form after each PIC operation. 
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III. PIC Group Decoding Algorithm 

In this section, we present a PIC group decoding algorithm that is, as we mentioned before, 
an intermediate decoding algorithm between the ML decoding algorithm and the ZF decoding 
algorithm, and has the ML decoding and the ZF decoding as two special cases. In the first 
subsection, we describe the PIC group decoding algorithm; in the second subsection, we discuss 
its connection with the ZF decoding algorithm; in the third subsection, we discuss the successive 
interference cancellation aided PIC group decoding algorithm (PIC-SIC); some examples are 
given in the last part of this section to illustrate the PIC group decoding algorithm. 



A. Partial Interference Cancellation Group Decoding Algorithm 

We now present a detailed description of the PIC group decoding algorithm. All the following 
discussions are based on the equivalent channel model in (9). First let us introduce some 
notations. Define index set X as 

J={0,l,2,...,ra-1}, 

where n is the number of information symbols in x. First we partition X into iV groups: 
X ,X 1? . . . ,X/v_i. Each index subset l k can be written as follows, 

Xfc = {ik,o, ik,i, ■ ■ ■ ■> ik,n k -i}, k — 0, 1, . . . , N — 1, 

where n k = \I k \ is the cardinality of the subset l k . We call X = {X ,Xi, . . . ,In-i} a grouping 
scheme, where, for simplicity, we still use X to denote a grouping scheme. For such a grouping 
scheme, the following two equations must hold, 

JV-l N-l 

X = Xj and ^ n; = n. 

i=0 i=0 

Define x Ik as the information symbol vector that contains the symbols with indices in T k , i.e., 

n T 



X ik,o 1 X ik,l i ■ ■ ■ i X ik,n k -l 



Let the column vectors of an equivalent channel matrix G(h) be <7o, <?i, • • • ,9n-i mat have size 
m x 1. Then, we can similarly define Gj k as 



G 



9ikfii 9ik,ii ' ' • ' 9ik,; 



-i] • do) 
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With these notations, equation (9) can be written as 

N-l 
i=0 

Suppose we want to decode the A;-th symbol group x Xk . Note that in the ZF decoding algorithm, 
to decode the A;-th symbol, the interferences from the other symbols are completely eliminated 
by a linear filter (the A:-th row of the pseudo-inverse matrix of the equivalent channel). The same 
idea can be applied here. We want to find a matrix (linear filter) P Xk such that by multiplying 
y by P Ik to the left (linear filtering), all the interferences from the other symbol groups can be 
eliminated. Such a matrix P Xk can be found as follows. Define Q Xk e C mxm as the projection 
matrix that projects a vector in C m to the subspace Y Xk that is defined as 

V Zfc = span{^,0 < i < n,i £ X k } . (12) 

Let Gj k G C mx< ™~™ fc ) denote the matrix that is obtained by removing the column vectors in G 
with indices in X k , i.e., 

Gi k = [Gi ,G Xl , . . . ,G Xk _^G Xk+1 , . . . ,G In _ 1 ] ■ (13) 
Then, the projection matrix Q Ik can be expressed in terms of Gj k as follows, 

Q Xh = cr^((cr Xk ) H (^y\cr Xk ) H , (U) 

where we assume Gj k is full column rank. If Gj k is not full column rank, then we need to pick 
a maximal linear independent vector group from G c Xk and furthermore, in this case the symbols 
in group X k can not be solved, which is out off the scope of this paper. Thus, we always assume 
that Gj k has full column rank. Define Pj k as 

Px k =I m -Qx k , (15) 

then P Ik is the projection matrix that projects a vector in C m onto the orthogonal complementary 
subspace Vj fe . Since the projection of any vector in Vj k onto Vj fe is a zero vector, we have 

P Ik G I% = 0, i = 0, 1, . . . , k - 1, k + 1, . . . , N - 1, (16) 

which is due to P Ik G Ik = 0. Define z Ik = Pj k y. By applying (16), we get 

N-l 

z Xk = VSNRP Xk Y] G x% x Xz + P Xk w 

i=o (17) 

= Vsmp Xk G Xk x Xk + p Xk w. 
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From (17), we can see that by passing the received signal vector y through the linear filter 
Pj k , the interferences from the other symbol groups are completely canceled and the output z Xk 
only contains the components of the symbol group x Ik . There may exist other matrices that can 
remove the components of the interference symbol groups in y. The following lemma shows 
that the linear filter matrix P Xk defined above is the best choice. 

Lemma 1. Consider the channel model in (11) and let SNR be the SNR of the system. Suppose 
we want to detect the symbol group x% k . Let V% k be the matrix set that contains all the matrices 
that can cancel the interferences from x Xi , < i < N,i ^ k, i.e., 



from ML decoding is denoted as P err {P-x k , SNR). Then for any given SNR, we always have 



where Pj k is defined as in (15). 

A proof of this lemma is given in Appendix A. 

Notice that since in our PIC group decoding, all the symbols in a group are decoded together, 
using highest SNR as the optimality may not be proper. This is the reason why in the above 
lemma, block error probability is used as the criterion for the optimality of a filter. Equation 
(17) can be viewed as a channel model in which xj k is the transmitted signal vector and z% k is 
the received signal vector. As we mentioned before in Section II, this channel model is derived 
from the interference cancellation procedure, and fits into the general channel model in (9). Note 
that in (17), the noise term Pj k w is no longer a white Gaussian noise. Despite of the presence 
of this non-white Gaussian noise term, the following lemma shows that the minimum distance 
decision is still the ML decision in this case. 

Lemma 2. Consider the channel model 




(18) 



The block error probability of the system 



zx k = Px k y = VsmPx k G Xk x Xk + P Ik w 



(19) 



P Jfc = arg min P err (Pi k , SNR), 



y = VSNRGx + w, 



(20) 
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where G G C mxn is the channel matrix that is known at the receiver, x G A n is the information 
symbol vector, w = Pw G C m , w is the white Gaussian noise vector and P G C mxm is a 
projection matrix that projects a vector in C m to a subspace V C C m . Assume the column 
vectors of G belong to V. Then, the decision made by 

y-VSHRGx 



x = arg mm 



is the ML decision. 



An intuitive explanation for the above lemma is that w is a degenerated white Gaussian noise, 
which can be a white Gaussian noise by removing some extra dimensions. Its detailed proof is 
given in Appendix B. According to Lemma 2, the optimal detection of x Ik from z% k is made 
by 



xj = arg mm 



z Jk - VSNRP Ik G Jk x 



(21) 



which is the PIC group decoding algorithm we propose in this paper. The complexity of the ML 
decoding of the dimension-reduced system in (17) is obviously lower than that of the original 
system in (11). The PIC group decoding algorithm (21) can be viewed as a decomposition 
of the original high-dimensional decoding problem with high complexity into low-dimensional 
decoding problem with relatively low decoding complexity. In the extreme case when all the 
symbols are grouped together, i.e., the problem is not decomposed at all, the PIC group decoding 
is the same as the ML decoding. In another extreme case when each symbol forms a group, i.e., 
the problem is completely decomposed, the PIC group decoding is equivalent to the ZF decoding. 
The detailed description of the connection between these two is given in the following subsection. 

B. Connection Between PIC Group Decoding and ZF Decoding 

In this subsection we discuss the connection between the PIC group decoding algorithm and 
the ZF decoding algorithm. In the case when the decoding problem is completely decomposed, 
i.e., each symbol group contains only one symbol, the PIC group decoding algorithm becomes 
a symbol-by- symbol decoding algorithm. The grouping scheme in this case is 

J = {Jo, Zx, . . . an-i} = {{0}, {1}, ...,{n-l}}, 

where 2j = {i}, < i < n — 1. To simplify the notations, we use i to denote Xj in this special 
case, hence P« = P Xi . All the other notations with J; as the subscript can be similarly defined. 
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Furthermore, we have x { = Xi = x Xi , 9i = Gi = Gj r From the PIC group decoding algorithm 
(21), the ML decision of x k from z k is made by 



x k = arg mm 

x k &A 



= arg mm 



z k -VsMp 



{P k g k ){P k g k T _ 



\Pk9k 



\g k x k + I 



{P k g k ){P k g k ) H 
\\P k g k \\ 2 



z k 



(22) 



(23) 



It is easy to verify that in (23) the last term is orthogonal to the first two terms, thus the above 
decision-making rule can be rewritten as 



x k = arg mm 



= arg mm 



{P f^ P f r z k - Vsmp k9k x k 
\\P k g k \\ 



+ 



i- 



{P k g k ){P k g k ) H 

ID II 2 
W k 9 k\\ 



Z k 



arg mm 

xt&A 



= arg mm 

x^eA 



arg mm 

x k eA 



VSNRP k g k 

(P k g k ) H 



( (P k g k 



VSNR \\P k g k 

(P k g k ) H P k 



\VSMR \\P k g k 

2^k %k 



. 2 %k %k 



Vsm \\P k g k 
{Pk9k) H 



~2V - X k 



■^y - x k 



VSNR\\P k 9k\ 

The last equality is the result of the Hermitian property and the idempotent property of the 
projection matrix, i.e., P" = P k and Pi = P k . Note that 



k gk) 



\\\Pk9k\\' 



gk 



and by applying (16) we have 



(Pk9kf 



g k Pkgk 
9kPkP k g k 

g^ (Pkg l ) 



\Pkg k \ 

( 

\\Pk9k\ 



0. 



gk r k-r k g k 

The above two equations imply that p^jp is the £>th row of (G(/i) H G(h)) _1 G(h) H . Thus, 
in the complete decomposition case, the PIC group decoding algorithm is equivalent to the ZF 
decoding algorithm. 

One negative effect of the interference cancellation procedure is that it may reduce the power 
gain of the symbol x k . Before the interference cancellation, the power gain of x k is \\g k \\ 2 , 
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1 1 2 

while after the interference cancellation, the power gain of x k becomes ||Pfc<7fc|| . Since P k is a 
projection matrix, we always have 

\\P k 9k\\ < \\9k\\- 

The equality holds if and only if g k is orthogonal to the space spanned by g i: % — 0, 1, . . . , k — 
\,k + \,...,n — 1. In the case of OSTBC, the columns of the equivalent channel are orthogonal 
to each other, and therefore, there is no power gain loss during the interference cancellation. 
Hence the performance of the ZF receiver is the same as the ML receiver for OSTBC. For all 
non-orthogonal STBC, an interference cancellation algorithm usually causes a power gain loss 
and therefore performance loss compared to the ML decoding. 

C. PIC-SIC Group Decoding Algorithm 

Notice that in the ZF decoding algorithm, we may use successive interference cancellation 
(SIC) strategy to aid the decoding process. We call the SIC-aided ZF decoding algorithm ZF- 
SIC decoding algorithm [41], [45]. The basic idea of SIC is simple: remove the already-decoded 
symbols from the received signals to reduce the interferences. When the SNR is relatively high, 
the symbol error rate (SER) of the already-decoded symbols is low and there is a considerable 
SER performance gain by using the SIC strategy. The same strategy can also be used to aid the 
PIC group decoding process to improve the SER performance. We call the SIC-aided PIC group 
decoding algorithm PIC-SIC group decoding algorithm. In the PIC group decoding algorithm, 
the decoding order has no effect on the SER performance. For the PIC-SIC group decoding 
algorithm, different decoding orders will result in different SER performances. We can obtain 
a better performance by choosing a proper decoding order. One simple way to determine the 
decoding order is to use maximum SNR as the criterion to arrange the decoding order, similar 
to BLAST ordered SIC algorithm. Suppose the ordered symbol sets are as follows, 

The ordered PIC-SIC group decoding algorithm is then: 

1) Decode the first set of symbols xj iQ using the PIC group decoding algorithm (21); 

2) Let k — 0, y — y, where y is defined as in (11); 
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3) Remove the components of the already-detected symbol set x Iik from (11), 

N-l 

y k+l 4 y k - VSNRG Iik x Iik = VSNR ^ G Xi . x Xi . + w. (25) 

j=k+l 

4) Decode in (25) using the PIC group decoding algorithm; 

5) If k < N — 1, then set k :— k + 1, go to Step 3; otherwise stop the algorithm. 

Remark 1. For the PIC group decoding algorithm, the equivalent channel matrix G(h) G C mxn 
must satisfy the condition Yj k C C m , otherwise = 0, i.e., there is no information left in z Xk 
about xi k . This requirement is generally weaker than that of the ZF decoding, which requires 
that m > n. For example, consider an uncoded MIMO system with 5 transmit antennas and 4 
receive antennas. In this case, the ZF receiver can not decode the received signals, while the 
PIC group decoding with the grouping scheme J = {J = {0, l,2},Ji = {3,4}} can do the 
decoding. 

Remark 2. For the PIC-SIC group decoding algorithm, we require that at each decoding stage, 
Yj k C C m . This requirement is even weaker than that of the PIC group decoding. Since we 
remove the interferences from the already-decoded symbols, the subspace Yj k shrinks each time 
when we finish decoding one symbol group. Consider the uncoded MIMO system in Remark 
1. Let X = {X = {0, 1,2,3}, Xi = {4}} be the grouping scheme. Then it is not possible to 
decode the second group symbol x 4 with the PIC group decoding algorithm, because after we 
remove the interferences from xo, x\, X2, xs, there is nothing left (zj 1 = 0) due to the lack of 
dimensionality. However, we can decode x A with the PIC-SIC group decoding. 



D. Examples 

Next we give some examples to illustrate the PIC group decoding algorithm. 
I) Example 1: Consider the Alamouti code with one receive antenna. The equivalent channel 
matrix can be written as 

G = \g ,gi], 



where 



go 



1 

71 



h 
hi 



9i 



1 



hi 



— h* 
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The grouping scheme is J = {I ,Ii} = {{0},{1}}. By a direct computation, we get the 
projection matrix P as follows, 



1 



\ho\ 2 + \hi\ 2 



\h \ h hi 
h*hl Ihf 



Then, the optimal detection of x is 



x = arg mm 



arg mm 

XqGA 



P y - VSNRPoPo^o 



SNR 



\h\ 2 + Ihf) x 



which is the same as the optimal detection formula derived in [1]. Similarly, the optimal detection 
for x\ is 



X\ = arg mm 

rriS.4 



Ky - h yi - ^-y- {\ h o\ 2 + l^il 2 ) x i 



2) Example 2: Consider the quasi-orthogonal STBC proposed in [40]. The code has the 
following form, 

X —X\ X 2 —Xl 
0b\ Xq X2 
Xi —X\ Xq —x\ 
0C\ ^0 

Suppose we use one receive antenna. The equivalent channel matrix G(h) can be written as 



G = [90,91,92,93], 



where 



h 








~h 2 ~ 




" h ' 


hi 


1 


-K 


1 


h% 


1 






,92 = ~ 




' 93 ~ 2 




h 2 




h 




h 3 




h x 


K 




-h* 2 _ 




K 




-K. 



9o = - 



Let J = {0, 2} and X 1 = {1, 3}. Then, the optimal detection of Xj is 

x Io = arg min P I() y - VSNRPj Gj Xz 

It is easy to verify that 



(26) 



9o-Lgi, 9o^9z, 92^9i, 92-L93, 
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so Y Xo ±Y Xl . This fact implies that P Xl G Xo = 0. The decoding rule in (26) can be simplified as 
i Xo =arg min ( P Xo y - VSNRP X() G Xo x X() + WP^yW ) 

x Xq €A 2 V / 

P Xl y- VSMRPt.G^x^ 



(jp Xo y - VSNRP Xq G Xo x Xo 

y - VSURG Xo x Xo 
The decoding rule of x Xl can be similarly derived, 



arg mm 



arg mm 



x Xl 



arg mm 



y-VSMRG Xl x Xl 



From the above equations, we can see that if the groups are orthogonal to each other, then the 
decomposition of the system is easy: just to pick up the column vectors corresponding to a 
group in G(h) and get a new equivalent channel matrix, then use this new channel matrix and 
the received signal y to do the ML decoding. In this case, no linear filtering is needed in the 
PIC group decoding and the ML decoding and the PIC group decoding are the same. 
3) Example 3: Consider the 3 by 8 overlapped Alamouti code in [32], 

x% Xi x* A xi 



X = 



X n 



Xo —x\ x 2 —x% x 4 —xt 



X\ X X3 

An equivalent channel matrix can be written as 



X2 



XI 



X4 



X 5 






G 



9o 9i 92 9s 9a 95 





K 



















h 


h 2 
















K 


-hi 


K 











1 





h 


hi 


h 2 
















h* 


-hi 


K 
















h 


h 


h 2 
















h* 2 


-hi 



















h 



Let the grouping scheme be J = {J = {0, 2, 4} , Ji = {1, 3, 5}}. It is easy to verify that 



gi±9j, % = 0,2,4, j = 1,3,5. 
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Similar to Example 2, the system can be decomposed into two systems without performance 
degrading. For general overlapped Alamouti codes, if we choose the grouping scheme as 

J = {J = {0, 2, 4, . . . , n — 2} ,2i = {1, 3, 5, . . . , n — 1}} , n even, 

J= {J = {0,2,4, ...,n- l},Ji = {l,3,5,...,n-2}},ra odd. 
then the system can always be decomposed into two systems without performance degrading. 
This property is the reason why overlapped Alamouti codes perform better than Toeplitz codes, 
since the interference comes from only half of the symbols. 

IV. Full Diversity Criterion For PIC and PIC-SIC Group Decodings 

In this section, we propose a design criterion for linear dispersion STBC to achieve full 
diversity with the PIC and the PIC-SIC group decodings. 

A. Notations and Definitions 

For convenience, let us first introduce some notations and definitions. Let S be a subset of 
the complex number field C, we define the difference set AS as follows, 

AS = {a — a, | a, a G S}. 

We also introduce the following definition, which can be viewed as an extension of the conven- 
tional linear independence concept. 

Definition 1. Let S be a subset of C and V; L G C m , i — 0, 1, . . . , n — 1, be n complex vectors. 
Vectors v ,Vi, . . . , v n -i are called linearly dependent over S if there exist do, ai, . . . , a n _i G S 
so that 

a v + ai«i H h a n _i« n _i = 0, (27) 

where a , a±, . . . , a n _i are not all zero; otherwise, vectors v , v±, . . . , v n _i are called linear 
independent over S. 

For diversity order, the following definition is known. 

Definition 2. Consider a communication system as described in (9). The system is said to achieve 
diversity order m if the symbol error rate Pser(SNR) decays as the inverse of the m-th power 
o/SNR, i.e., 

Pser(SNR) <c-SNR-"\ 
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where c> is a constant independent o/SNR. 

The conventional concepts of linear independence and orthogonality are defined among vectors. 
Next, we define them among vector groups. 

Definition 3. Let V = {vi e C n , i — 0, 1, 2, ... ,k — 1} be a set of vectors. Vector v fc is said to 

be independent of V if for any e C, % — 0, 1, . . . , k — 1, 

fc-i 

diVi ^ 0. 

Vector v k is said to be orthogonal to V if v k -Lv iy % — 0, 1, . . . , k — 1. 

Definition 4. Le? Vo, Vi, . . . , V n _i, V n be n + 1 groups of vectors. Vector group V n is said to be 
independent of Vo, Vi, . . . , V n -i if every vector in V n is independent of\J™~Q Vi. Vector group V n 
is said to be orthogonal to Vo, Vi, . . . , V„_i if every vector in V n is orthogonal to \XZq Vector 
groups Vo, Vi, . . . , V n are said to be linearly independent if for < k < n, Vk is independent 
of the remaining vector groups Vo, Vi, . . . , Vk-i, Vk+i, • • • , V„. Vector groups Vo, Vi, . . . , V n are 
said to be orthogonal if for < k < n, Vk is orthogonal to the remaining vector groups 
Vo, Vi, . . . , Vfc_i, Vfe+i, . . . , V„. 

In the remaining of this paper, for convenience, a matrix notation such as G is also used to 
denote the vector group that is composed of all the column vectors of G. 

B. Design Criterion of STBC with the PIC Group Decoding 

In this subsection, we derive a design criterion of codes decoded with the PIC group decoding. 
First we introduce the following lemma, which gives a sufficient condition to achieve full diversity 
for the general channel model in (9) with the ML receiver. 

Lemma 3. Consider a communication system modeled as in (9). A is a signal constellation 

used in the system. If the channel matrix G(h) satisfies the following inequality, 

i-i 

\\G(h)Ax\\ 2 > c ■ M 2 || Aa;|| 2 , VAx e AA n , 

i=0 

for some positive constant c, then the system achieves diversity order I with the ML receiver. 
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The proof of this lemma is simply a matter of computation of some integrals, which is quite 
similar to those derivations in [13], [37]. A detailed proof is given in Appendix C. To understand 
the meaning of Lemma 3, let us first define the power gain for the channel model in (9). 

Definition 5. Consider the communication system modeled as in (9). A is a signal constellation 

used in the system. The power gain of the system is defined as 

. \\G(h)Axf 

P = mm k — . 

AxeAA n ||Ax|| 

If the power gain P satisfies the following inequality, 

i-i 

i=0 

for some positive constant c, then we say that the system achieves power gain order I. 

From Lemma 3, one can see that the diversity order is ensured by the above power gain order 
and it can be further interpreted as follows. Suppose that there are two different symbol vectors 
x , X\ E A n . The distance between the two symbol vectors is ||Ax|| = ||cco — cci||. Assume there 
is no noise in the channel, i.e., w = 0, then after the symbol vectors pass through the channel, 
we get G(h)x and G(h)x\. Now the distance between received signals G(h)x and G{h)x\ is 
||Gr(h)Aa;||, which is greater than \[P ||Ax||, i.e., the channel "expanded" the distance between 
x and a; x by a factor of at least \[P. The expansion factor \f~P determines the diversity order 
that can be achieved. Lemma 3 tells us that if the expansion factor \J~P of the symbol vector is 
greater than ( c • J2i=o 1^1 ) f° r some c > 0, then diversity order / can be achieved. Note that 
the power gain order can be viewed as a count of how many path gains summed up in P. We 
can rephrase Lemma 3 simply as: if the power gain is a sum of I path gains, then the diversity 
order of the communication system in (9) is I. 

Next, we present the main result of this paper, which characterizes the power gain order of a 
linear dispersion STBC decoded with the PIC and the PIC-SIC group decoding algorithms. 

Theorem 1 (Main Theorem). Let X be a linear dispersion STBC. There are rit transmit and 
n r receive antennas. The channel matrix is H e C" rXnt . The received signal is decoded using 
the PIC group decoding with a grouping scheme {I ,I 1 , . . . ^Xjy^}. The equivalent channel 
is G(h), where h = vec(H) = {ho, hi, . . . , h nr . nt -i} G C nr ' nt . Then, each of the following 
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(28) 



dimension-reduced systems (i.e., the STBC with the PIC group decoding), 

z Ik =P Ik G Ik x Ik +P Ik w, k = 0,l,..-,N-l, 

has power gain order n r ■ n t if and only if the following two conditions are satisfied: 

• for any two different codewords X,X e X, AX = X — X has the full rank property, i.e., 
the code X achieves full diversity with the ML receiver; 

• Gj Q ,Gx 1 , . . . ,Gj N _ 1 defined in (10) from G = G(h) are linearly independent vector groups 
as long as h ^ 0. 

When the received signals are decoded using the PIC-SIC group decoding with the ordering 
(24), each dimension-reduced system derived during the decoding process (i.e., the STBC with 
the PIC-SIC group decoding) has power gain order n r ■ n t if and only if 

• for any two different codewords X,X E X, AX = X — X has the full rank property, i.e., 
the code X achieves full diversity with the ML receiver; 

• at each decoding stage, Gj ik , which corresponds to the current to-be decoded symbol group 
x ik , and [Gj ik+l , ■ ■ ■ iGx iN J are linearly independent vector groups as long as h ^ 0. 



With the above theorem and the preceding discussions on the relationship between diversity 
order and power gain order, the two conditions in the above theorem provide a criterion for a 
linear dispersion code to achieve full diversity with the PIC group decoding. 

Let us see an example to use the above main theorem. Consider the rotated quasi-orthogonal 
scheme proposed in [35] for a QAM signal constellation, where the code X has the following 
structure, 



X = 



X 




ax 2 — a*xl 


Xi 


Xq 


axs a*x\ 


ax 2 


—a*xl 


Xq —xl 


axs 




Xl Xq 



a 



cxp 



T 



(29) 



Suppose we use one receive antenna, the column vectors of the equivalent channel G are as 
follows, 



1 

9o = ~ 



h 




" hi ' 




ah 2 




ahs 


hi 


1 


-h* 


1 


ahl 


1 


-ahl 












h 2 




h 3 




aho 




ahi 


K 




-hi 




ahl 




—ah*Q 



(30) 
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It has also been proved in [35] that this code achieves full diversity with the ML receiver, hence 
the first condition is satisfied. Let the grouping scheme be {J = {0, 2}, J x = {1, 3}}, then Gj Q 
and Gj L are linearly independent. Thus, both conditions are satisfied. Note that the two groups 
are actually orthogonal, which means that every vector in Gj Q is orthogonal to Gj 1 and vice 
versa. Hence after the interference cancellation, there is no power gain loss. In this case, the 
PIC group decoding is exactly the same as the ML receiver. 

C. Proof of the Main Theorem 

In order to prove the main theorem, let us first introduce the following lemma. 

Lemma 4. Consider a communication system modeled as in (9). A is a signal constellation 
used in the system. If the equivalent channel matrix G{h) satisfies the following two conditions: 
• scaling invariance: 



• the column vectors of G{h) are linearly independent over AAfor any ^ h G C l , 
then the system has power gain order I and thus achieves diversity order I with the ML receiver. 

A proof is given in Appendix D. Note that if each entry of G(h) is a linear combination of 
h ,hi, . . . ,hi-\ and h$, h\, . . . , h*_ v then the scaling invariance (31) always holds. So we have 
the following corollary. 

Corollary 1. Consider a communication system modeled as in (9). Each entry of G(h) is a 
linear combination of ho, hi, ... , hi_i and h$, h\, . . . , h*_ v A is a signal constellation used in 
the system. If the column vectors of G{h) are linearly independent over AAfor any ^ h = 
[ho, hi, ... , /^-i] T G C l , then the system has power gain order I thus achieves diversity order I 
with the ML receiver. 

One may wonder for linear dispersion STBC, whether the above condition is an equivalent 
condition of the full rank criterion. The following theorem gives a positive answer to this question. 

Theorem 2. Let X be a linear dispersion STBC. Let A be a signal constellation for the coding 
scheme X. Let G(h) be the equivalent channel of X and h and h ^ 0. Then X has the full 
rank property if and only if the column vectors of G{h) are linearly independent over AA. 




(3D 
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Its proof is in Appendix E. 

Now we are ready to prove the main theorem. The main idea is to prove that the dimension- 
reduced systems in (28) satisfy the two conditions in Lemma 4. 

1 ) Sufficiency part: First we prove that the two conditions in the main theorem are sufficient 
conditions for codes to achieve the full power gain with the PIC group decoding algorithm. 
According to Theorem 2, the first condition is equivalent to that the column vectors of G(h) are 
linearly independent over AA. This further implies that the column vectors of G Xk are linearly 
independent over AA, i.e., for any a , a 1: . . . , a„ fe _i G AA, aj, j — 0, 1, . . . , n k — 1, not all zero, 
we have 

n k -l 
3=0 

Since G Xo , G Xl , • • • , G Xn _ 1 are linearly independent, the column vectors g ik , j — 0, 1, . . . , n k — 1, 
in Gi k do not belong 1 to the vector space Yj k defined in (12). From (32) and the fact that 
9i kJ & we have 

a j9ik,j ■ 

3=0 J 3=0 

By applying the above inequality, we get the following inequality, 

n k -l (n k -\ 

^ ajPi k 9i kJ = Pj k a ^i k , : 

3=0 \ j=0 

n k -l 

= (Im-Ql k ) J2 Wikj + °> 
3=0 

i.e., the column vectors of P Ik Gj k are also linearly independent over AA. 

Now we prove that P Ik Gj k satisfies the scaling invariance (31) in Lemma 4. Since both P Ik 

and Gx k are determined by the parameter vector h, for a clear exposition, we temporarily use 

P Ik (h) to denote P Xk and use Gj k (h) to denote Gj k . Then we have 

1 „ , 



M P Xk {h)G Xk {h) = P Xk {h) [jj^Gi k (h) 
= P Xk {h)G X] 1 h 



\h\ 



h \ „ ( h 



x k I 777777 I Gl k 



h\\J k \\\h 

'Here the linear independence over the whole complex field of the vector sets is needed/used and the linear independence 
over A.4 is not sufficient. 
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where the second equality holds since the entries in Gj k are all linear combinations ofh ,hi,..., h Ur . nt 
and h* Q ,h\,..., h* nr . nt , and the last equality holds since 

P Xk {h)=P Xk < h 



\h\ 

which is a direct result from the definition of Q h in (14) and the fact that Pj k = I m — Qx k - 

Thus, the two conditions in Lemma 4 are all satisfied and therefore for any k, the dimension- 
reduced system 

z Xk = Vsm(P Ik G Ik ) x Ik +p Ik w, 

has power gain order n r ■ n t . 

Now let us consider the case when the received signals are decoded with the PIC-SIC group 
decoding. We use the conventional assumption that the previous decoded symbols are correct. 
Thus, there is no error introduced when we use these decoded symbols to reduce the interferences 
from the received signals. Under this assumption, the PIC-SIC group decoding algorithm is 
always better than the PIC group decoding algorithm. Thus, the two conditions are sufficient for 
the PIC-SIC case. 

2) Necessity part: We next prove that these two conditions are also necessary conditions. 
If G Tk and G Xk = [G Xo , ■ ■ ■ ,G Xk _^G Xk+1 , . . . ,G Xjv _J are not linearly independent, i.e., there 
exists a column vector in Gj k such that this vector belongs to the subspace Yj k . Without loss 
of generality, we assume this vector is g ik0 . In this case, we have 



Pr,G 



0? Plk9ik,iiPlk9ik,2i ■ ■ ■ > Plk9ik,n k -i j • 

Take Ax Jk = [a, 0, 0, ... , 0] T G AA Hk , where a G AA, a ^ 0, then we have \\P Ik G Ik Ax Ik || = 
0, which contradicts with the condition that the systems in (28) have power gain order n r • n t . 
Thus, we must have that Gj k is linearly independent of Gj k . Since k is an arbitrary integer 
number in [0, N — 1], Gj ,Gj 1 , . . . , Gj N _ 1 are linearly independent. This proves that the second 
condition in the main theorem must hold. 

Let Ax 7^ G AA n and Ax Ik , k — 0, 1, . . . , N — 1, be the corresponding sub-vectors of Ax 
to the grouping scheme. Thus, there is at least one Ax Ik ^ 0. Without loss of generality, we 
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2 

N-1 2 

i=i 

Since P X{) G X{) Ax Xq e V^ and Q Xo G Xo Ax Xo + ^-1^ ' G Xk Ax Xk e Y Xq , we have 

AT-l 2 

Q Xo G Xo Ax Xo + ^ G Jfc A£c Jfc 
i=i 

(rir-nt—l \ 
|^| 2 J ||Aa; Jo || 2 > 0, ft^O. 

Using Theorem 2, the first condition in the theorem is proved. 

In the case that the received signals are decoded with the PIC-SIC group decoding, we 
assume the decoding order is T io ,l ii: . . . ,X ijv l . Similar to the above argument, we must have 
that Gi H) is linearly independent of Gx h , G Xi2 , • • • , G XiN _ L ; G Xii is linearly independent of 
G x ,Gt , . . . , G x ; Gt is linearly independent of G x ,G X , . . . , G x etc. So we have 
that G Xi , G Xii , . . . , G XiN are linearly independent. The proof of the first condition to be 
necessary is the same as the PIC case. This completes our proof of the main theorem. 



assume Ax Xo ^ 0. Then, 



\\G(h)Ax\\ 2 = 



N-1 



G Xo Ax Xo + J2 G ^ Ax 



i=l 



P-t„G-t„Ax-t„ +QtGt,Ax 



G(h)Axf = \\P Xo G Xo Ax Xo f + 



D. Connection with the Full Rank Criterion and the Shang-Xia Criterion 

In the case when there is only one group, then the PIC group decoding algorithm becomes 
the ML decoding. In this case the second condition can always be satisfied. Thus, our proposed 
design criterion in Theorem 1 is equivalent to that of [13], [37]. 

We now consider the symbol-by-symbol grouping case of the PIC group decoding algorithm, 
which is equivalent to the ZF decoding algorithm. In this case when each group contains only 
one symbol, the second condition can be rephrased as: G{h) is a column full rank matrix for 
hjL 0. 

Corollary 2. In the case of symbol-by-symbol PIC group decoding, i.e., each group only contains 
one symbol, the design criterion in the main theorem is equivalent to the Shang-Xia criterion 
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proposed in [32], i.e., 

det (G(h) H G(h)) > c \\h\\ 2n , h e C l , 
where c is a constant independent of the channel h. 

Proof: Since we have that G(h) is full column rank for h ^ 0, the following inequality 
must hold, 

det (G(h) H G{h)) > 0, h ^ 0, 

Let us restrict the parameter h to the unit sphere, i.e., \\h\\ = 1. Note that the unit sphere is 
a compact set, det (G(h) H G(h)) is a continuous function of h. There must exist a positive 
constant c > such at 

det (G{h) H G{h)) >c, h^O, 

as what is used in [48]. Generally, for h e \ {0}, we have that 

h \ H „ / h 



del l G (j^J I >r - (33) 

Since the entries of G(h) are linear combinations of h 0: hi, . . . , hi-i and h* , hi, . . . , h*_ v in- 
equality (33) can be rewritten as 



det 

Thus, 



( 1 ^ 2 - I > c, /i ^ 0. (34) 



det (G(h) H G(h)) > c \\h\\ 2n , h e C, (35) 

which is the Shang-Xia condition given in [32]. This proves that the criterion in Theorem 1 
implies the Shang-Xia criterion in the case when all symbols are in separate groups, i.e., the ZF 
receiver. 

Since the criterion in Theorem 1 is necessary and sufficient, it can be derived from the Shang- 
Xia criterion too. In other words, the criterion in Theorem 1 is equivalent to the Shang-Xia 
criterion in the case when the ZF receiver is used. ■ 
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E. Some Discussions 

From Theorem 1 and Theorem 2, it is interesting to see that for a linear dispersion STBC 
(complex conjugates of symbols may be embedded) to achieve full diversity: (i) the weakest 
criterion is that the column vectors of the equivalent channel matrix are linearly independent 
over a signal constellation A when the ML receiver is used, which is equivalent to the code full 
rank criterion known in the literature; (ii) the strongest criterion (in the sense of the simplest 
complex- symbol- wise decoding) is that the column vectors of the equivalent channel matrix are 
linearly independent over the whole complex field when the ZF receiver is used, which is, in 
fact, weaker than the orthogonality in the OSTBC case that is not necessary for achieving full 
diversity with a linear receiver. In the case of the weakest criterion but the optimal and the most 
complicated receiver, i.e., ML receiver, the symbol rate can be n t for n t transmit antennas. In the 
case of the strongest criterion but the simplest receiver, i.e., linear receiver, the symbol rate can 
not be above 1 [32]. Note that the rates of OSTBC approaches to 1/2 as the number of transmit 
antennas goes to infinity and are upper bounded by 3/4 for more than 2 transmit antennas [44]. 
By increasing the decoding complexity and improving a receiver as increasing the group sizes 
in our proposed PIC group decoding, the criterion to achieve full diversity becomes weaker. The 
criterion for the PIC group decoding serves as a bridge between the strongest and the weakest 
criteria for the ZF and the ML receivers, respectively, and the corresponding symbol rates are 
expected between 1 and n t . The examples to be presented later in Section VI are some simple 
examples to show this rate-complexity tradeoff. 

V. Asymptotic Optimal Group Decoding 

In this section, we propose an asymptotic optimal (AO) group decoding algorithm, which can 
be viewed as an intermediate decoding between the ML decoding and the MMSE decoding 
algorithms [3], [41]. 

A. Asymptotic Optimal Group Decoding Algorithm 

Consider the channel model in (9). Suppose the signals are decoded using a group decoding 
algorithm, and the grouping scheme is T = {I ,li,l 2 , ■ ■ ■ ,Zn-i}- Assume the symbols are 
taken from a signal constellation A according to the uniform distribution. The optimal way to 
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decode x Xk from the received signals is to find x Xk e A nk such that 

x Xk = arg max : P (y \ x Xk ) . 
To derive the decoding rule, let us first write (11) in the following form, 



y = VSNRG Xk x Xk + VSNR ^ G Xi x Xi +w. (36) 



0<i<JV-l 

i^k 



Note that except for the symbol group x Xk , all the other symbols can be viewed as noises that 
interfere with x Xk . Define the noise term w Xk as 

w Xk = Vsm G ii x ii + w 

()<i<N-l 

i^k (37) 

= VSNRj^ QiXi + w. 

Then, we can write (36) as 

y = VSMRG Ik x Ik + w Ik . (38) 

The optimal decoding of x Ik from the received signal vector y depends on the distribution of 
the noise Wj k , which is difficult to analyze in general. To simplify the discussion, we assume 
that the noise w Ik is Gaussian. This assumption is asymptotically true when the number of the 
interference symbols is large. Similar assumption has been used in [24]. We call the optimal 
result derived under this assumption asymptotically optimal. 

Under the above assumption, the probability density function P (y \ xj k ) can be explicitly 
expressed and the optimal decoding rule can be easily derived. First let us compute the covariance 
matrix of the noise vector Wj k : 

K Xk =£{w Xk wl} 
Hence the probability density function P (y \ x Xk ) is as follows, 

p (y I *x fc ) = r^j^P (" {v ~ y/smch h x Xk y K x ] (y - VsmG Xk x Xk )^ . 

For the above equation, we can see that maximizing P (y | x Xk ) is equivalent to minimizing 
(y - VsmG Xk x Xk ) H K-] (y - VsmG Xk x Xk ) = \k~} (y - VSMRG Xk x Xk ) f , 
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where K x * is the square root of the matrix K x \ So the asymptotic optimal decoding rule is 



x%, = arg max 



K~l (y - VSMRG Ik x Ik ) 



(39) 



When J = {J } = {{0,1, 2, — 1}}, we only have one symbol group Xj , which contains 
all the symbols. The variance of the noise is K Io = I m . In this case, the above decoding rule 
can be simplified as 



6 X <=A™ 



y - V$MG(h)x 



which is the ML decoding. 



B. Connection with the MMSE Decoding 

Now let us consider the symbol-by-symbol case of the AO group decoding algorithm. In this 
case, J = 1q,1i, . . . ,T n -i = {{0}, {1}, • • • , {n — 1}}. In the following discussion, we use the 
simplified notation convention introduced in III-B. Thus, we use K k instead of K Xk to denote 



So the decoding rule is 



arg max 



= arg max 

xt&A 



Kj (y-VSHRg k x^ 
9 H k K k l y 



The term 



Vsmg H k K k9k 

c m d ii,- is m e unbiased estimator of x k . In this case, the AO group decoding 

VoNKg k K k g k 

algorithm is equivalent to the unbiased MMSE decoding [41]. By a proper scaling, we can get 
the MMSE estimator from 7==^ J — [41]. Although the MMSE estimator is optimal with 

VSNRg k K k g k 

respect to the mean squared error, it may not be optimal with respect to the symbol error 
probability and the unbiased MMSE may have a better performance [3]. 



C. AO-SIC Group Decoding Algorithm 

Similar to the PIC case, we can use the SIC technique to aid the AO group decoding process. 
The decoding order can be simply determined according to the maximum SINR criterion, which 
is similar to the PIC-SIC case. Suppose the ordered symbol sets are 

xi io ,xi ii ,...,x Tin _ i . (40) 
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The following is the AO-SIC group decoding algorithm: 

1) Decode the first set of symbols x% i using the AO group decoding algorithm; 

2) Let k = 0, y = y, where y is defined as in (11); 

3) Remove the components of the already-detected symbol set x Xik from the (11), 

N-l 

y k+1 ^y k - VsmG Ik xj. = Vsm V G i x Xi .+w- (41) 

j=k+l 

4) Decode in (41) using the AO group decoding algorithm; 

5) If k < N — 1, then set k :— k + 1, go to Step 3; otherwise stop the algorithm. 

VI. Design Examples 

In this section, we present two design examples that achieve the full diversity conditions with 
pair-by-pair PIC group decoding. 

A. Example 1 

Consider a code for 2 transmit antennas with 3 time slots of the following form, 

CXq + SXi CX 2 + sx% 

X = , (42) 

— SXq + CXi —sx 2 + cx 3 

where c = cos 9,s = sin 9,8 £ [0, 2ir). The symbol rate of this code is |. 

In the following, we show that this code can be decoded with pair-by-pair PIC group decoding. 

Theorem 3. Let A C be a QAM signal constellation. LetX = {{0, 1} , {2, 3}} be a grouping 
scheme for the PIC group decoding algorithm. If tan £ Q, then code X in (42) achieves full 
diversity using the PIC group decoding algorithm with the grouping scheme X. 

Proof: Firstly, we prove that the code given in (42) has full rank property for any A C Z[i]. 
In order to prove this, we only need to prove that for any x { e = 0, 1, 2, 3, which satisfies 

that Xi not all equal to zero, X is full rank. Since tan 9 £ Q, equation cx + sx 1 = holds 
for x ,xx E AA if and only if x = x 1 = 0. Similarly, equation — sx 2 + cx 3 = holds for 
x 2 ,x 3 E AA if and only if x 2 = x 3 = 0. Next, we discuss two different cases. 

i). When x and x x are not all equal to zero and x 2 and x 3 not all equal to zero, then 
cx + SXi 7^ 0, — sx 2 + cx s ^ 0. In this case, X is full rank; 
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ii). When x and x 1 are not all equal to zero but x 2 = x 3 = 0, then 

cxo + sxi 

— sx + cxi 
is full rank; similarly, in the case when x 2 and x 3 are not all equal to zero but x = 
Xi — 0, X is full rank too. 
So the code in (42) has full rank property. 

Next, we prove that the code X satisfies the second condition in the main theorem. Suppose 
there is only one receive antenna, the equivalent channel can be written as 

cho sh 
—shi chi ch sh 
—shi chi 

obviously g and g 1 can not be expressed as a linear combination of g 2 ,gs, and vice versa, 
when h ^ 0. Thus, [go,gi] and [#2, #3] are linearly independent, when h ^ 0. According to the 
main theorem, the code achieves full diversity with the PIC decoding algorithm provided that 
the grouping scheme is J = {{0, 1} , {2, 3}}. ■ 



[90,91,92,93] = 



B. Example 2 

The following code is designed for 4 transmit antennas with 6 time slots. 

CX + SX 1 —CX\ — SX\ CX A + SX 5 — CXg — SXy 

CXq + SXi —CX* 2 — SX3 CX4 + SX 5 —CX% + SXj 



X = 











CX2 + SX3 CXq + SX\ CXq + SX-j CX\ + SX\ 











(43) 



CX 2 + SX3 CXq + SX\ CXq + SX 7 CX\ + Sx\ 

where c = cos 6, s = sin 9, 9 G [0, 2%). It can be proved that this code satisfies the two conditions 
given in the main theorem if the grouping scheme is X = {{0, 1} , {2, 3} , {4, 5} , {6, 7}}. 

Theorem 4. Let A C Z[i] be a QAM signal constellation. Let! = {{0, 1} , {2, 3} , {4, 5} , {6, 7}} 
be a grouping scheme for the PIC group algorithm. If tan 9 £ Q, then the code X in (43) achieves 
full diversity using the PIC group decoding algorithm with the grouping scheme X. 

Proof: The proof is similar to the 2-transmit-antenna case. First we prove that this code 
satisfies the full rank criterion. This is easy to verify just by looking into the code case by case 
as the previous proof. 
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Next we prove that the second condition in the main theorem also holds. In the case when 
there is only one receive antenna, the equivalent channel matrix can be written as follows, 

G = [go, 9i,92, Ss, 94, 95, 9e, 9t\ 



ch 


sho 


chi 


sh\ 














chl 


sh\ 


-chl 


-sh* 














—sh 2 


ch 2 


-sh 3 


ch 3 


ch 


sh 


ch\ 


chi 


—sh% 


C K 


sh* 2 


-ch* 2 


ch{ 


sh\ 


-chl 


-sh* 














-sh 2 


ch 2 


-sh 3 


ch 3 














-sh* 3 


chl 


sh* 2 


-ch\ 



(44) 



Let h 0. We can see that [g , gri] is orthogonal to [g 2 , g 3 ]. Vector group [g , gi] is also linearly 
independent of #4, #5, #6, fl^- Thus, [go,gi] can not be expressed by any linear combination of 
the rest column vectors in G. A similar discussion can be applied to the other vector groups. 
Therefore, the second condition in the main theorem also holds. This completes the proof. ■ 

VII. Simulation 

In this section, we present some simulation results. In all the simulations, the channel is 
assumed quasi-static Rayleigh flat fading. First we choose the rotation angle 9 for the codes in 
(42) and (43) by numerically estimating the coding gains of the codes for a series of values of 
9. Here the coding gain C g is defined as 



C g = arg max <^ C g \ 

■* blockerr 

(SNR) < —SNR 



-D a 



(45) 



where D g is the diversity order. We use Monte Carlo simulations to estimate the coding gains for 
different 0's. As we can see from Fig. 1, the peak value of C g is reached at two points: 9 = 0.55 
and 9 = 1.02. Interestingly enough, these two values of 9 are very close to \ arctan(2) and 
\ — \ arctan(2), which maximize the coding gain of the 2 x 2 diagonal code [41]. An intuitive 
explanation is that the code in (42) can be viewed as two diagonal codes stacked together and 
even after the interference cancellation, 9 — \ arctan(2) and 9 — | — \ arctan(2) still maximize 
the coding gain. 

In Fig 2, we compare our new code in (42) for 2 transmit antennas with the Alamouti code for 
2 transmit antennas, Golden code [2] for 2 transmit antennas, and the QOSTBC for 4 transmit 
antennas with the optimal rotation [35] and the symbol-pairwise decoding. The number of receive 
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antennas is 3. The constellation for our new code is 64 QAM. The constellation for the Alamouti 
code and for the QOSTBC is 256 QAM and for Golden code is 16 QAM. Thus, in all schemes, 
the transmission rate is 8 bits/sec/Hz. The PIC group decoding with group size 2, i.e., symbol- 
pair-wise decoding, is used for our new code and Golden code decodings. Fig. 2 shows that 
our proposed coding scheme is about 1.5 dB better than the Alamouti code and outperforms the 
QOSTBC before SNR reaches 27 dB, while the PIC group decoding has higher complexity than 
the symbol-wise decoding for the Alamouti code but has similar complexity as the QOSTBC 
ML symbol-pair- wise decoding. For Golden code, since it is only full rank (although with non- 
vanishing determinant), it does not achieve full diversity when the PIC group decoding is used, 
which can be seen from Fig. 2. 

For the 4 by 6 code in (43) for 4 transmit antennas, we compare it with the QOSTBC with the 
optimal rotation [35] and Nguyen-Choi code [28]. The number of receive antennas is also 3 for 
all these codes. Our new coding scheme uses a 64-QAM constellation and the QOSTBC uses a 
256-QAM constellation so that the bit rates for both schemes are 8 bits/sec/Hz. For Nguyen-Choi 
code, the constellation is 32-QAM (it is obtained by deleting the four corner points from the 
6 by 6 square QAM as what is commonly used) so that the bit rate is 7.5 bits/sec/Hz. We use 
the PIC and PIC-SIC group decodings for the new code, respectively, and the ML decoding for 
the QOSTBC, and the PIC-SIC group decoding for Nguyen-Choi code. In this case, all these 
decodings are symbol-pair-wise based. The simulation results show that our new code with the 
PIC group decoding and the PIC-SIC group decoding is 2.3 dB and 2.8 dB better than the 
QOSTBC, respectively. From Fig. 3, one can see that our new code does achieve full diversity 
as compared with the full diversity QOSTBC and the diversity gain of Nguyen-Choi code is 
smaller than that of our new code. 

Note that the overlapped Alamouti codes proposed in [32], [33] with linear receivers may 
outperform all the other existing codes with linear receivers in the literature including perfect 
codes [11], [29] but do not outperform OSTBC or QOSTBC for the number of transmit antennas 
below 5. 

VIII. Conclusion 

In this paper, we first proposed a PIC group decoding algorithm and an AO group decoding 
algorithm that fill the gaps between the ML decoding algorithm and the symbol-by-symbol linear 
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decoding algorithms namely the ZF and the MMSE decoding algorithms, respectively. We also 
studied their corresponding SIC-aided decoding algorithms: the PIC-SIC and the AO-SIC group 
decoding algorithms. We then derived a design criterion for codes to achieve full diversity when 
they are decoded with the PIC, AO, PIC-SIC and AO-SIC group decoding algorithms. The new 
derived criterion is a group independence criterion for an equivalent channel matrix and fills 
the gap between the loosest full rank criterion for the ML receiver and the strongest linear 
independence criterion of the equivalent channel matrix for linear receivers. Note that the full 
rank criterion is equivalent to the loosest linear independence for the column vectors of the 
equivalent channel matrix over a difference set of a finite signal constellation while the strongest 
linear independence criterion is the linear independence for the column vectors of the equivalent 
channel matrix over the whole complex field. The relaxed condition in the new design criterion 
for STBC to achieve full diversity with the PIC group decoding provides an STBC rate bridge 
between n t and I, where rate n t is the full symbol rate for the ML receiver and rate 1 is the 
symbol rate upper bound for linear receivers. Thus, it provides a trade-off between decoding 
complexity and symbol rate when full diversity is required. We finally presented two design 
examples for 2 and 4 transmit antennas of rate 4/3 that satisfy the new design criterion and thus 
they achieve full diversity with the PIC group decoding of group size 2, i.e., complex-pair-wise 
decoding. It turns out that they may outperform the well-known Alamouti code, Golden code, 
and QOSTBC. 

Appendix A 
Proof of Lemma 1 

Proof: Writing P Ik defined in (15) and an arbitrary matrix P Ik e Vi k in the following 
forms, 

Pi k = [Po,p!,---,P T m -i] T , 
according to the definition of Vi k in (18), we must have that p*, p* e Yj k , % — 0, 1, . . . , m — 1. 
Note that rank (Pj k ) = rank (Pj k ) = dim (VjJ , which implies that all the vectors in Yj k can be 
expressed as linear combinations of p*, i = 0, 1, . . . , m — 1. So there must exist -, < i, j < m, 
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rn—l 
i=0 

or in the matrix form we have P Xk = FP Xk , where the (i, j)-th entry of F is f iy j. So P Xk can 
be viewed as a concatenation of the linear filters P Xk and P. Substituting the above equation 
into (19), we get 

~zi k = F (Pi k G Xk x Xk + P Xk w) = Fz Xk , (46) 

where z Xk = P Xk G Xk x Xk + P Xk w. For an SNR, the optimal decoding of x Xk from z Xk is as 
follows, 



x x , = arg min P (z x , I ) , 
and the optimal decoding of x Xk from z Xk is as follows, 

x Xk = arg min P (Fz Xk \x Xk ) . (47) 
Notice that any filtering may not help an ML decision. Therefore, for an SNR, we have 

P err (P Xfc ,SNR) <P err (P Jfc ,SNR), 
which completes the proof. ■ 

Appendix B 
Proof of Lemma 2 

Proof: Since P is a projection matrix, P can be decomposed as 

P = U"DU, (48) 
where U G C mxm is an unitary matrix and 



D 



I r xr Orxm-r 
Orra— rxr Ora-rxm-r 



, r = rank(P). (49) 



By multiplying both sides of (20) by U to the left, we have 

Uy = UGx + DUw. (50) 
Since the column vectors of G belong to V, G — PG, (50) can be written as 

Uy = UPGx + DUw = DUGx + DUw (51) 
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Note that the effect of multiplying D to the left of a vector is picking up the first r entries and 
setting the rest n — r entries to zero. Hence from (51), we can see that only the first r entries 
of Uy matter and all other entries are zeros. We also have that the first r entries of DUw are 
i.i.d. Gaussian noise since U is unitary, the rest n — r entries are all zeros. Use [v] r to denote 
the vector that contains the first r entries of v e C m . Then, (51) is equivalent to 

[Uy] r = [DUGx] r + [DUw] r . (52) 

Since [DUw] r is a white Gaussian noise, the ML decision is the same as the minimum distance 
decision for (52), i.e., 

x = argmin ||[£A/] r — [-Di/Ga^H 

* eA ' (53) 
= argmin \\Uy — DUGx\\ , 

where the second equality holds because the last n — r entries have no effect on the distance. 
Noting that U is an unitary matrix and G = PG, the above detection is equivalent to 

x = arg min \\U H Uy — U H DUGx\\ = arg min \\y — Gx\\ . (54) 

x<=A n 11 ' 11 xeA n 

Thus, we conclude that the minimum distance decision in this case is equivalent to the maximum 
likelihood decision. ■ 

Appendix C 
proof of Lemma 3 

Proof: For a given h = [h , h±, . . . , /^_i] T , and two symbol vectors x, x e A n with x ^ x, 
the pairwise error probability Ph(x — ► x) with ML receiver is as follows, 

P h (x -> x) =Q (VSNR\\G(h)Ax\\) 

<Q I c- VSNR \^2\hi\ 2 ) \\Ax\ 



(55) 




c 2 • SNR 1 



\hi\ 2 \\Ax\ 



i=0 

where the last inequality is obtained by applying the well-known upper-bound for the Q-function, 

~, x 1 f x2 
Q{x) < - exp I — — 
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By taking expectation over h at both sides of (55), we get 

P(x^x)=£ h {P h (x^x)} 

f ( c 2 -SNR^,, , 
<£/i<expl 2^\hi\ || A 

To evaluate the above expectation, we use 

E h (exp{-a\h\ 2 ) = — !— , h ~ CAf (0, 1) , a > 0, 
1 + a 

and note that the expectation can be taken separately to each hi, which leads to the following 
result, 

~x 1 ( 2 
P(x ^x)<- 



2 V2 + c 2 SNR||A£c| 



nl-l 

< 2i Sm ~ l - 

c 21 \\Ax\\ 

Since Ax e A„4 n and A^4 n is a finite set, there exists a Aa3 such that 

d min = ||Ax || = min{||Aa3|| ,0 ^ Ax e AA n } . 
Hence for any x, x e A n with x ^ x, we always have 

P(cc -> x) < ^ SNR-'. 

(c • C^min) 



The symbol error probability Pser(SNR) is upper-bounded by 



Pser(SNR)< 2 " 1(I ^ |W ^ SNR-', 
i.e., the system achieves the diversity order I. ■ 

Appendix D 
Proof of Lemma 4 

Proof: For a given 7^ h e C and 7^ Aa? e A^4" fc , since the column vectors of £x(/i) 
are linearly independent over AA, G(h)Ax 7^ 0, or 

\\G{h)Ax\\ > 0. (56) 

Now let us consider a fixed Ax e A„4 n and restrict the parameter /i to the unit sphere, i.e., 
\\h\\ = 1. Since the unit sphere is a compact set, from (56), for this Ax there must exist a 
constant CAa; > such that 

\\G(h)Ax\\ > c Ax . (57) 
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For 7^ h G C l , we always have 

1 



h 



G(h)Ax 



G 



h 

\h\ 



Ax 



^ c Aco 



or 



2-1 \ 2 

2 



||G(h)Acc|| > c Ax \\h\\ = c Ax ^2 \ h 

\i=0 

Since AA n is a finite set, we can define c min and rf max so that 

< c min = min {c Aa; , 7^ Ax G AA n } , 
< rf m ax = max {|| Ax || , Ax G AA n } . 



Then 



||G(h)Ax|| > ^2. I Vl^l 2 ) IIAxI 



=0 



5> 



|Ax|| , VAa; G Ai",/iG C', 



.i=0 



where c = t™"-. This completes the proof. 



dm 



(58) 
(59) 

(60) 
(61) 



(62) 



Appendix E 
Proof of Theorem 2 

Proof: Let H = (h itj ) G C ntXHr be the channel matrix as in (1) and h = vec(if). Suppose 
,Y is an STBC that satisfies the full rank criterion, i.e., any matrix 7^ AX G AX is a full rank 
matrix. Write AXAX H into the following decomposition 



AXAX H = UDU H . 



(63) 



where U G C ntXnt is an unitary matrix and D = diag(A , Ai, . . . , A nt _i). Since AX is a full 
rank matrix, A min (AX) = min{A , Ai, . . . , A nt _i} > 0. Note that AX is a finite set, we can 
define A min such that 



A min = min {A min (AX), ^ AX G A*} > 0. 



(64) 
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Hence we have 

|| H AX || 2 = tr (HAXAX H H H ) 
= tr (hUD(HU) h ^ 

>tr (x^HUimjf) (65) 

n r — lnt—l 
1 1 2 

As (7) mentioned in Section II, ||ifAX|| can also be written as 

\\HAX\\ 2 =\\G(h)Ax\\ 2 . (66) 

where Ax E AA n . By (65) and (66), we can see that for h ^ 0, G(h)Ax = if and only if 
Aa? = 0, i.e., the column vectors of G(h) are linearly independent over AA. 

We now prove the necessity. Since X is a linear dispersion code, the scaling invariance (31) 
is satisfied. If the column vectors of G(h) are linearly independent over AA, then according to 
Lemma 4, there exists a constant c > such that 

||#AX|| 2 = tr {HAXAX H H") 

rir-lnt-l (67) 

> c || A£c|| 2 Yl l^il 2 ' WH e £nrXnt - 

Next we prove that the above inequality implies that the eigenvalues of AXAX H are all greater 
than zero for AX ^ 0. The uniqueness from the decodablity of the STBC X tells us that 
AX 7^ implies Ax ^ 0. Consider the decomposition (63) for AX. If there is an eigenvalue 
\ k = 0, then we can find an H e C nrXnt such that 

HU=[0,0,...,v,...,0], (68) 

where the A;-th column vector v e C" r can be arbitrary non-zero vector. The existence of such 
H 7^ is ensured since U is invertible. For the H that satisfies (68), 

tr (tfAXAX^) = tr [HUD (HU) H ^j = 0, (69) 

which contradicts with the inequality in (67). So we have proved that all the eigenvalues of 
AXAX H must satisfy A, > 0, i.e., AX is a full rank matrix. ■ 
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Fig. 1. Coding gain estimation for 9 G [0, ^) 
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Fig. 2. Performance comparison of several coding scheme and their bandwidth efficiencies are all 8 bits/sec/Hz 
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Fig. 3. Performance comparison of several coding schemes for 4 transmit and 3 receive antennas 
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